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ABSTRACT 

We study a sample of ~10'^ galaxy clusters in the redshift range 0.2<2:<0.8 with masses M2oo>5x lO^®/iyg^M0, discovered 
in the second Red-sequence Cluster Survey (RCS2). The depth and excellent image quality of the RCS2 enabled us to 
detect the cluster-mass cross-correlation up to 2^0.7. To obtain cluster masses, concentrations, and halo biases, we fit 
a cluster halo model simultaneously to the lensing signal and to the projected density profile of red-sequence cluster 
members, because the latter provides tight constraints on the cluster miscentring distribution. We parametrised the 
mass-richness relation as M200 — Ax {N 200 /2Q)°‘ and find A — (15.0 ± 0.8) x Mq and a — 0.73 ± 0.07 at low 

redshift (0.2<2<0.35). At intermediate redshift (0.35<2<0.55), we find a higher normalisation, which points towards a 
fractional increase in the richness towards lower redshift caused by the build-up of the red sequence. The miscentring 
distribution is well constrained. Only ~30% of our BCGs coincide with the peak of the dark matter distribution. The 
distribution of the remaining BCGs are modelled with a 2D-Gaussian, whose width increases from 0.2 to 0.4 h^g^Mpc 
towards higher masses. The ratio of width and r2oo is constant with mass and has an average value of 0.44 ±0.01. The 
mass-concentration and mass-bias relations agree fairly well with literature results at low redshift, but have a higher 
normalisation at higher redshifts, possibly because of selection and projection effects. The concentration of the satellite 
distribution decreases with mass and is correlated to the concentration of the halo. 

Key words, gravitational lensing - dark matter haloes 


1. Introduction 

Observations of galaxy clusters provide a wealth of as- 
trophysical and cosmological information. A key quantity 
of clusters is the mass, because it determines the relative 
importance of various processes such as AGN feedback. 
Furthermore, given a large sample of cluster masses, the 
cluster mass function can be determined and compared to 
simulations in order to constrain cosmological parameters, 
such as the normalisation of the matter power sp ectrum, 
P's, and the cosmologic al matter density, Dm (e.g. lEvrardI 
ll989l:IWhite et al1ll993D : if the redshift baseline of the sam¬ 
ple is sufficiently large, the dark energy equation of state 
can be constrained (e.g. IVoitll2005l : lAllen et al.ll201lll . 

The mass of a cluster is not a direct observable, but 
can be determined with a variety of techniques. The ve¬ 
locity distribution of cluster member s have been used to 
deriv e dynamical mass e stimates (e.g. Ivan der Marel et ^ 
I2OOOI: iLokas et al.l l2006l) . but these observations are gen¬ 
erally expensive since they require spectroscopic observa¬ 
tions of many cluster members. Additionally, assumptions 
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on the satellite orbits are needed to convert the veloc¬ 
ity dispersions into a mass estimate. X-rays observables 
can a lso be used to estimate the mass (see lEttori et al.l 
I2OI3I for a review), under the assumption that the hot 
cluster gas is in hydrostat ical equilibrium. The results of 
iMahdavi et al.l (120081120131) support the results f rom hydro- 
dynamical simulations (e.g. iNelson et alll2014ll that clus¬ 
ters are generally not in hydrostatical equilibrium, which 
biases the X-ray-based mass estimates. Another powerful 
method of obtaining cluster masses is weak gravitational 
lensing. 

In weak lensing, the distortion of the images of faint 
background galaxies (sources) due to the gravitational po¬ 
tentials of intervening structures (lenses) is measured. This 
signal is proportional to the excess surface mass density, 
which can be modelled to obtain the mass. Weak lensing 
does not rely on direct tracers of the potential; the distor¬ 
tion can be measured for any lens, out to large radii where 
no visible tracers can be used. Additionally, the weak lens¬ 
ing signal does not depend on the physical state of the mat¬ 
ter in the clusters, and no assumptions have to be made 
(e.g. virial equilibrium) to measure the total projected 
mass. Weak lensing has been used to determine the mass of 
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individual massive low-redshift clusters (e.g. Hoekstral2007t 

Okabe et al. 201C 

: Hoekstra et al. 20121: AoDleffate et al. 

2014 

Gruen et al. 

20131: lUmetsu et al. 12014 iKettula et al. 

2014 

Hoekstra et 

al.1 2 OI 5 II, as well as the average mass of 


samples of cluste rs and galaxy groups by stacking their lens- 


ing s i gnals ('e.g. [Mandelbaum et al 


l2009t Covone et alJl20l4 iFord et al. l2015ir 


2006al : ISheldon et alJ 


From the weak-lensing signal, many cluster properties 
can be extracted, such as the cluster mass, concentration, 
halo bias, and miscentring distribution. The relation be¬ 
tween these parameters can help constrain models of cluster 
physics. To determine the mass function, however, we need 
mass estimates of a large number of clusters. The lensing 
signal of all but the most massive clusters is generally noisy; 
only by stacking the signal of samples of clusters can the 
average mass be robustly constrained. A common solution 
is to determine how an observable cluster property scales 
with mass and can serve as a ma ss proxy. The Sunyaev - 
Zeldovich effect has been used le.g. lWilliamson et al.ll201lll 
and appears particularly useful for estimating the masses of 
massive clusters at high redshifts through scaling relations. 
Another observable property is the richness, which has the 
advantage that it can be determined from the same multi¬ 
colour imaging data that is used for the lensing analysis. 

To determine the richness of a cluster, it is neces¬ 
sary to distinguish cluster galaxies from fore- and back¬ 
ground galaxies and, if necessary, correct for contamination. 
Cluster members can be identified if their redshift or ve¬ 
locity dispersions are available, which requires either spec¬ 
troscopy or observations in many bands for reliable pho¬ 
tometric redshifts. Alternatively, cluster members can be 
identified using their colours as the majority of early-type 
galaxies in a cluster populate a narrow range in colour- 
magnit ude space, that is, the E/SO ridge line or the red se¬ 
quence (iGladders fc Yeell2000[l . The advantage of the latter 
is that observations in only two bands suffice, which makes 
it cheap and particularly suited for the aut omated detection 
of clu sters in large imaging surveys (e.g. IGladders fc Yeel 
I 2 OO 5 II . Additionally, only a few field galaxies reside in this 
regime of colour-magnitude space, which reduces the con¬ 
tamination. 

In this paper we use optical imaging dat a from the sec¬ 
ond Red-sequence Gluster Survey (RGS2; iGilbank et al.l 
I 2 OIIII . both to detect clusters and determine their rich¬ 
ness, as well as to measure their lensing signals. The sur¬ 
vey design was chosen so as to optimise the detection of 
a large number of clu sters using a red-sequence method 
(|Gladders fc Yeel[200flh . In total, ^10"^ clusters have been 
detected in the RGS2, spread over a wide range in optical 
richness, with redshifts 0.2<2:<0.9. In contrast, the redshif t 
range of the maxBCG cluster sample (jKoester et al.l[2007ll , 
a catalogue of 13823 clusters th a.t has been detec ted in the 
Sloan Digital Sky Survey fSDSS: lYork et al1l2000[l , covers a 
redshift range of0.1<z<0.3, which limits its use for evolu¬ 
tionary studies. The redshift range of clusters in the RCS2, 
combined with the excellent lensing quality of the data, 
makes the RCS2 well suited to this purpose. 

The outline is as follows. In Sect. [2j we present the var¬ 
ious steps of the analysis: we discuss the cluster detection 
and richness estimates (EH), provide details of the lens¬ 
ing measurement (12.211 . and discuss our halo model EH- 
We highlight our novel approach of including the cluster- 
satellite correlation to constrain the miscentring distribu¬ 
tion in Sect. 12.3T] We present the mass-richness relation in 


Sect. O the cluster miscentring distribution in Sect. the 
mass-concentration relation in Sect. [5l the satellite distri¬ 
bution in Sect. [Bland the mass-halo bias relation in Sect. [71 
We conclude in Sect. E Throughout the pa per we assume a 
WMAP7 cosmology Komatsu et al.l[201Ill with ag = 0.8, 
Da = 0.73, Dm = 0.27, Db = 0.046 and h = 0.7 the di¬ 
mensionless Hubble parameter. All distances quoted are 
in physical (rather than comoving) units unless explicitly 
stated otherwise. 


2. Analysis 


The RCS2 is a nearly 900 square degree imaging 
survey in three bands , F and £) carried out with the 
Canada-France-Hawaii Telescope (CFHT) using the one 
square degree field of view camera MegaCam. The primary 
imaging data covers 740 square degrees of sky, divided in 
13 patches. The survey area can be split into 145 blocks 
of contiguous non-overlapping 2x2 degrees of sky, which 
we use to estimate the bootstrap covariance matrices of 
our measurements as discussed later on. Hence in total we 
use 580 square degrees. The lensing analysis is performed 
on the eight minute exposures in the F-band (rj'jj^~24.3), 
which is best suited for lensing with a median seeing of 
0.71". 


The pho tometric calibration o f the RCS2 is described 
in detail in IGilb ank et aTI (|201lll . the lensing analysis in 
Ivan Uitert et al.l (|201lh . For details, we refer the reader 
to these works. In short, we measured t he shapes of 
2.2x 1 0^ galaxies using the KSB method ( Kaiser^^alJ 
I 1995 I : iLuppino fc Kaise3ll997l: iHoekstra et al.l 19981 200011 . 
which corresponds to a source number density of 6.3 
arcmin~^. Two ma jor improvemen t s to t h e lensing analysis 
were i ntroduced in iGacciato et al.l ([201411 ; Ivan Uitert et al.l 
(l20T^: we used the photometric redshift catalogues 
from ilbert et al. l (I 2 OI 3 II instead of the catalogues from 
lllbert et al.l (1200^ to estimate the source redshift distri¬ 
bution; and secondly, we introduced a correction scheme 
to account for a multiplicative bias in our KSB method 
(due t o noise bias (iKacprzak et al.ll2012l : iMelchior fc Violal 
I 2 OI 2 I : iRefregier et al.l l2012ll . an d galaxy blends), w hich 
affects the lensing measurement ([Hoekstra et al.ir2015ll . 


2.1. Cluster detection 


Galaxies clusters are identi fied using a mo dihed version of 


the algorithm presented in Lu et al.l ( 

20Q9I1. This is a sim- 

plified version of the IGladders & Yee 

(200511 red-sequence 


cluster-finding method. The main modifications are de¬ 
scribed in Gilbank et al. (in prep.), detailing how the 
method was applied to RCS2 data. Briefly, the significance 
of a candidate cluster overdensity is determined from a 
count of (colour-selected) g:alaxies within a c ircle w ith a ra¬ 
dius of 0.5 Mpc. Unlike th ^Gladders fc YH (|2005H method, 
no magnitude weighting is applied to the galaxies, and the 
uncertainties in the cluster and field counts are estimated 
directly from Poisson statistics. Colour slices are built at 
regular colour intervals (which leads to irregular bins in 
photometric redshift), and each slice overlaps its neighbour 
by a quarter of the slice width. For RCS2, the method is 
identical except that the model colour-redshift relation was 
transformed to the RCS2 filters. One additional modifica¬ 
tion for RCS2 involves the centring adopted, which aims 
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Fig. 1. Redshift versus logiQ(A^ 20 o), the logarithm of the 
number of early-type cluster members brighter than M* +1 
inside t^qq ■ The clusters cover a wide range in richness and 
redshift, and are therefore very well suited to studying the 
redshift dependence of the mass-richness relation. 


to locate the BCG via a simple two-step approach, as dis¬ 
cussed in Appendix [A] 

Richnesses are est imated using an appr o ach similar to 
the one ou tlined in iKoes ter et ah l (l2007li . iHansen et al.l 
(|2005l [ 20 ?)^ ■ and I.Tohnston et al.l (|20n7li for the maxBCG 
cluster sample. Firstly, the number of red-sequence galax¬ 
ies brighter than M* -|- 1 within an aperture of radius one 
Mpc is counted. M* is estimated from si mple stellar pop - 
ulation synthesis models, as described in iLu et ^ (l2009li . 
This count is then used to estimate r 2 oo (the radius inside 
of which the density is 200 times the critical density, pc) for 
each cluster using the relation r|gQ = O.lSOA^g,;^® Mpc from 
iHansen et al.l (|2005li . The number of red-sequence candi¬ 
dates brighter than M* -|-1 within gives iV 2 oo, our rich¬ 
ness estimate. Since we also apply a background subtrac¬ 
tion, N 2 Q 0 are non-integer values. 

The maxBCG papers listed above use a variety of 
slightly different scaling relations which vary in both ex¬ 
ponent and normalisation depending on precisely how r 2 oo 
wa s derived. The relatio n we choose is similar to that given 
in IHansen et al.l (|2005li . although their radius is quoted 
in h^QpMpc and here we do not correct for the different 
cosmology, given the range of values in the normalisation 
from method-to-method. In practise, a different normalisa¬ 
tion should just lead to counting within a different fraction 
of r 2 oo- Therefore, mass-richness estimates within a single 
cluster sample should still be internally self-consistent, and 
to compare between samples it should suffice to apply a 
constant systematic scaling. 

The distribution of cluster redshifts and richnesses are 
shown in Fig.jTJ To assess the accuracy of the ‘red-sequence’ 
redshifts, we match the BGGs to the galaxies with spectro¬ 
scopic redshifts from t he ninth data release of the SDSS 
(DR9; lAhn et ^ l2012f) . In total, we find 2212 matching 
galaxies. We compare the redshifts in Fig. [5] and find that 



Fig. 2. Gomparison of our ‘red-sequence’ redshifts to the 
spectroscopic redshifts from SDSS. 


they agree quite well. Note that the stripes of ‘red-sequence’ 
redshifts reflect the redshifts slices used to detect the clus¬ 
ters. Only at redshifts below 2<0.3 and above z>0.7, the 
‘red-sequence’ redshifts are slightly overestimated. The av¬ 
erage difference in each redshift slice is shown in Fig. [31 
At low redshift, the bias is likely the result of only using 
red-sequence templates up to 2:=0.248; at lower redshifts, 
the colours become degenerate. The cause of the bias at 
2 ;>0.7 is less obvious, but seems to indicate that the ini¬ 
tial calibration of the red-sequence templates using spec¬ 
troscopic data was affected by outliers (either mismatches 
or matches that sampled regions with poorer than average 
photometry). We examined these clusters in detail and de¬ 
cided there was no obvious reason to reject them from the 
analysis. We correct the cluster redshifts for this bias by 
htting a third-order polynomial to these matching galaxies, 
and apply the same shift to all our clusters. The richnesses 
are recomputed using the corrected redshifts. Note that Fig. 
[T]shows the corrected redshifts and the corrected richnesses, 
the quantities used in this work. 

After correcting the redshift bias, we compute the scat¬ 
ter and show it in the lower panel of the same figure. When 
we remove obvious outliers using | 2 rcs 2 — .^sdssI > 0.15, 
which are likely mismatches between the photometric clus¬ 
ter and the galaxy with spectroscopy, the scatter has a value 
of ~0.03 and does not vary much with redshift. Note that 
for ‘red-sequence’ redshifts larger than 0.8, the bias and 
scatter cannot be well determined because of the low num¬ 
ber of matches. For this work that is not important, as we 
restrict our analysis to clusters with 2 : <0.8. 

To evaluate our richness estimates, we match our clus¬ 
ters to the maxBCG catalogue. We find 114 matches and 
compare the N 200 estimates in Fig. The richnesses of 
clusters in the RGS2 appear systematically larger at the 
high-richness end. This may partly be attributed to an im¬ 
proved deblending in the RGS2 because of the better ob- 
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0.2 0.4 0.6 0.8 

z(RCS2) 


Fig. 3. Mean difference between our ‘red-sequence’ red- 
shifts and the spectroscopic redshifts from SDSS {top). 
Scatter between the redshifts after accounting for the 
bias {bottom). Open symbols indicate the scatter for all 
matches, solid ones are obtained after removing the out¬ 
liers (|zrcS2 — 4sdss| > 0.15). The dashed/dotted-dashed 
line shows the mean scatter including/excluding outliers. 
We correct the mean redshift bias in our analysis. 


serving conditions. The scatter between the estimates is 
quite large, part of which may be attributed to particular 
settings in the cluster detection algorithm that deal with 
mergers and projections on the sky. 

Not all detections in the cluster catalogue are real clus¬ 
ters: a fraction of the clusters may actually correspond to a 
chance projection of galaxies rather than to a real cluster. 
These false detections have presumably a different lensing 
mass than the real clusters of that richness, but since both 
the richness and mass are affected simultaneously, the bias 
on the scaling relation is expected to be small. The frac¬ 
tion of real clusters is called the purity, which is generally 
a function of richness and redshift, but also depends on the 
cluster detection algorithm. Therefore, to determine the ac¬ 
tual value of the purity for our cluster sample, we need to 
apply the detection algorithm to mock data that mimic the 
RCS2, which has not yet been done. The false positives do 
not add random noise, but a coherent (but likely lower) 
lensing signal. How large the impact is on the lensing mass, 
needs to be addressed with simulations. Note that the de¬ 
tection significance in our cluster finder is high, favouring 
a high purity over a high completeness. 

Figure [T] shows that our cluster sample is incomplete at 
the low richness end. In principle, this should not affect our 
results, as the average lensing signal only becomes noisier 
if we miss clusters in our sample. However, to use the clus¬ 
ter sample to constrain cosmological parameters, a detailed 
knowledge of the completeness function is a prerequisite. 



Fig. 4. Comparison of the cluster richnesses of 114 matched 
clusters from maxBCG and RCS2. The solid line shows the 
one-to-one relationship. 


2.2. Lensing measurement 

The shapes of source galaxies are distorted by the 
gravitational potentials of clusters. In the regime where 
the surface mass density is sufficiently small, the gravita¬ 
tional shear can be approximated by averaging the elliptic- 
ities of source galaxi es (for a review of cluster lensing, see 
iHoekstra et al. 1201311 . We determine the tangential compo¬ 
nent of the shear in radial bins centred on the BCGs, 


{lt){r) 


AS(r) 

Scrit 


( 1 ) 


which is related to the surface density contrast, 
AE(r) =E(<r)-E(r), 


( 2 ) 


the difference between the mean projected surface density 
enclosed by r and the mean projected surface density at a 
radius r. The shear signal at small scales around massive 
clusters is so large that it no longer can be approximated 
as being linearly related to the galaxy ellipticities. We ac¬ 
counted for this when we computed the models (see Section 
12.31) . Ecrit is the critical surface density: 

with D\, Dg and Hr the angular diameter distance to the 
lens, the sou rce, and between the len s and the source, 
respectively (jBartelmann fc Schneider! l200lll . Since we 
lack redshifts for the sources, we select galaxies with 
22<mr'<24: that have a reliable shape estimate (el¬ 
lipticities smaller than one, no SExtractor flag raised) 
as our source sample. To determine their approximate 
redshift distribution, we apply identical magnitude cuts 
to the publicly available photometric redshift catalogue 
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of lllbert etlill (|2011{ll . We use this redshift distribution 
to compute the mean lensing efficiency, (Dis/Ds), as a 
function of lens redshift, accounting for the changing 
average weight of the sources as a function of apparent 
magnitude, and accounting for the impact of outliers in 
th e catalogue. This pro cedure is outlined in Appendix C 
of ICacciato et al.l (^2014^. _ 

In Appendix B of van Uitert et alJ (I2015I1 . we present 
a test of the robustness of our measurement algorithm. In 
short, we measure the lensing signal using different source 
magnitude cuts, and for each cut we recompute the lensing 
signal (after deriving new lensing efficiencies, random 
signals, noise bias corrections and source galaxy contam¬ 
ination corrections). Both at low and high redshifts, the 
resulting lensing measurements are consistent, suggesting 
that this method of measuring tangential shear is robust. 

Here we present results from an additional test. Since 
we only select galaxies with a flux radius that is at least 
10% larger than the local PSF size, it is possible that 
we systematically remove the smallest, highest redshift 
galaxies from our analysis; we do not account for that when 
we compute the lensing efficiencies, which could potentially 
lead to biases. We check that by removing the 10% smallest 
objects (in terms of Vh) from the photo-z catalogues of 
lllbert et all (|201.lll . and recompute the lensing efficiencies. 
At low lens redshifts, the results are practically unchanged. 
At z = 0.7, the highest mean lens redshift that we study 
in this work, the lensing efficiencies only decrease by 3%. 
Removing the smallest 40% of objects (much more than 
we do in practice) leads to a decrease of 8%. Hence the 
effect is not completely negligible, but it is smaller than 
our statistical errors and unlikely to lead to a significant 
bias. 

As mentioned before, the cluster redshifts have been 
corrected for a mean bias. This correction is uncertain 
for redshifts larger than 0.8 due to the low number of 
matches. Hence we restrict ourselves to clusters at 2;<0.8. 
The scatter of the lens redshifts also affects the lensing 
measurement. We estimate the effect in Appendix [B1 and 
find that the impact on the lensing signal is at the per cent 
level. Therefore, it can be safely ignored. 

The distortions induced by weak lensing are much 
smaller than the intrinsic ellipticities of the sources. 
The lensing measurement of a single cluster is therefore 
generally very noisy. To improve the signal-to-noise, the 
lensing signal is stacked for a sample of clusters that have 
similar properties (e.g. within a certain richness range). 
Stacking the lensing signal has the additional advantage 
that the contribution from uncorrelated structures, as well 
as from potential small-scale residual systematics, averages 
out. The lensing signal of the total cluster sample that is 
used in this work is shown in Fig. [5j 

On large scales, residual systematics might affect the 
lensing signal. These systematics are commonly removed 
by measuring the lensing signal around random points and 
subtracting that from the real signal. Such a correction 
could also remove some real signal, hence we do not apply 
this correction, but use it as a test of the quality of our 
catalogues. The mean random signal, averaged over the 145 
non-overlapping blocks of 2x2 degree each, is consistent 
with zero in the radial range that we use in this work. 


2.2.1. Contamination 


A fraction of our source galaxies is physically associ¬ 
ated with the clusters. They are not lensed and therefore 
dilute the lensing signal. We cannot remove them from the 
source sample because we lack redshifts. We could remove 
the bright early-type cluster members using their colours, 
but the faint cluster members cannot be efficiently removed 
because their red sequen ce is not well defi ned, and because 
many of them are blue (jHoekstral l2007fl . Fortunately, we 
can account for the dilution of the lensing signal by mea¬ 
suring the excess source galaxy density around the lenses, 
/cg(r), and boost the lensing signal with I-|-/cg(r). As 
a further precaution, we exclude measurements on scales 
<150 ft-yg^kpc in our analysis. 

This correction implicitly assumes that the satellite 
galaxies are randomly oriented. If the satellites are pref¬ 
erentially radially aligned to the lens, however, the con¬ 
tamination correction may be too low. Most recent stud¬ 
ies i n this field report no significant radial alignment 


3 .g. ISheldon et al 


20091: Hao et al']l2011l: ISchneider et al.l 


120131 : IChisari et ah I20I4I : ISifon et al.f 201511 . although some 


earlier work claimed that such an alignment exists (e.g. 
iPereira k. KuhnI [20051 : iFaltenbacher et aDl2007ll . 

We do not account for the bias in the contamina¬ 
tion correction which results from the blocking of the 
background sky by large ( fore ground) cluster g alaxie s 
(ISimet fc MandelbaumI [ 201 ^ . In van Uitert et al.l l|2015ll . 
we estimate it for LRGs in the RCS2 and find that the cor¬ 
rection is biased by 5% for low-redshift LRGs at a projected 
separation of 50 h^g^kpc. For higher redshifts, and at larger 
separations, the bias quickly decreases, hence it is safe to 
neglect it here. 

Magnification by the clusters can also increase or de¬ 
crease the source density, and hence bias the contamina¬ 
tion correction. The ratio of the lensed and unlensed source 
counts (i.e. the bias) is given by with /r the lens¬ 

ing magnification and a the power law index of the flux 
number density distribution. Using the photometric red¬ 
shift catalogue of lllbert et al.l (|2013h . we find that a = 1.09 
at 22 < r' < 24. To estimate the size of this bias, we assume 
that the mass distribution follows an NFW profile. For a 
cluster at 2 ; = 0.3 with a mass of M 200 = 5 x 10 
(the largest mass we probe), we find a bias of ~5% at 150 

kpc (the smallest lens-source separation we use). For 
lower masses and at larger separations, the bias becomes 
much smaller. This bias is smaller than our statistical er¬ 
rors and we can safely ignore it. 


2.3. Modelling of the signal 

In recent years it has become common practice to 
model the weak-lensing signa l around galaxies and galaxy 
clust ers using the halo model (ISeliakll2000HCoorav fc ShethI 
I 2 OO 2 II . This model provides a statistical framework of the 
distribution of matter in the Universe. Basically, it assumes 
that this distribution can be modelled as a collection of 
separate dark matter haloes in which galaxies and galaxy 
clusters are embedded. The lensing signal comes from the 
haloes that host the galaxy cluster and from neighbouring 
dark matter haloes. 

The h alo models we c r eate are similar to those de¬ 
scribed in iJohnston et al.l l|2007ll . Our model consists of 
four terms, namely the contribution of the BGG, AEbcGi 
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0.1 1.0 10.0 
Distance to lens [h~^ Mpc] 


Fig. 5. Stacked lensing signal (top) and cluster-satellite cor¬ 
relation (bottom) measured for all clusters with N 200 > 2 
and 0.2 < z < 0.8 in the RCS2. The vertical dot-dashed 
lines indicate the fitting range for the cluster halo model. 
Both measurements are shown for illustration only. 


the contribution of the cluster halo, ASnfWj the contri¬ 
bution from miscentred haloes, AEmig, and a term that 
accounts for the con tribution from n eighb ouring haloes, 
AS 2 h- The models of I Johnston et al.l (|2007ll also included 
a correction to account for the fact that the shear is not 
exactly linear. This term is only important on small scales 
for the most massive clusters. Since we only fit scales larger 
than 150 kpc, this term can be safely ignored. 

The lensing signal of the BCG is suffi¬ 
ciently accurately modelled as a point source. 
We model its mass usin g the BCG-h alo mas s 
scaling relation from I Johnston et al.l (l2007ll : 
Mbcg = 1.334 • 1012/(1 -k [M 200 / 6.717 • 1013]-i. 38)^ in 
units of h~^MQ. The contribution of the BCG to the total 
lensing signal is very small on scales > 150h/gi and we 
merely add it for completeness. 

The central dark matter hal o is described by a Na varro- 
Frenk-White prohle (NFW; iNavarro et aLllTOQOll . The 
NFW density profile is given by 


p{r) 


ScPc 

(r/rs)(l -k r/rs)^ ’ 


(4) 


with Sc the characteristic overdensity of the halo, pc 
the critical density for closure of the universe, and 
fs = ?" 20 o/c 20 o the scale radius, with C 200 the concentration 
parameter. The NFW profile is fully specified for a given 
set of (M 2 oo,C 2 oo)j with M 200 the mass inside a sphere of 
radius r 2 oo. We calculate the tangentia l shear profil e using 
the analytical expressions pro vided by iBartelmannI (|1996r) 
and lWright fc BrainerdI (|200(Tll . 

For a fraction of the clusters, the adopted BCG does not 
correspond to the actual centre of the dark matter distribu¬ 
tion. To compute the lensing signal of a miscentred NFW 


profile, we first calculate the miscentred surface mass den¬ 
sity: 


T,{r\rcnm) = -^ j d 6 >ENFw(Y ^’'2 -(- 


"mis - 2rr„ii, cos(6»)) 

(5) 

with rmis the offset from the peak of the dark matter. 
Following I Johnston et al.l l|2007f) . we assume that the mis- 
centring distribution is reasonably well described by a 2D 
Gaussian, 


R(Fmis) = ^^exp {-\{rcn\s/crsf 
crt \ 2 


( 6 ) 


with as the width of the distribution. For a given distribu¬ 
tion, the mean surface mass density is then given by 


s(r') = r 

Jo 


(^mis) ^NFW I ^ mis )■ (7) 


The model lensing signal is computed as usual using 


model lensing signal is c 
s(7’) = Sn^is(<1 t ) Sniis(r) 
t large projected separatic 


At large projected separations, neighbouring clusters 
significantly contribute to the lensing signal. This contri¬ 
bution is modelled as the two-halo t e rm fro m the halo 
model presented in iMandelbanm et al.l (|2006bll . AE 2 h. To 
avoid that neighbouring haloes overlap, we implement the 
halo-e xclusion prescription as detailed in Ivan Uitert et al.l 
(I 2 OIIII . The amplitude of the two-halo term is set by the 
linear bias parameter, which is a free parameter in the 
fit. The main difference with this two-halo term compared 
to other commonly used descriptions (e.g. Cacciato et al.l 
l2009l : lvan Uitert et al.ll20lTI : iLeauthaud et ahl 201211 is that 
it is computed using the linear power spectrum rather than 
the non-linear one. We note that the regime where the one- 
halo and two-halo terms overlap (at a few Mpc) is difficult 
to model in general due to uncertainties in the prescription 
of halo exclusion and non-linear biasing. Marginalizing over 
the linear bias mitigates the impact of these uncertainties 
on the other fit parameters. We have implemented various 
variations of the two-halo term and we will discuss the ef¬ 
fect this has on our results in the forthcoming sections. 

Close to massive clusters, the lensing signal is so large 
that the relation between the source galaxy ellipticities and 
the gravitational shear can no longer be approximated as 
being linear. To account for this, we convert the model 
shear to the reduced shear, which is the quan¬ 

tity we measure observationallv (TSeitz fc Schneider! 119971 : 
iHoekstra et al.l[2?)?)?)l : I Applegate et al. 1201411 : 


AS"®'^ = 


AE" 


□del 


1 - 


lAV Scrit 


( 8 ) 


where 


/3s 


TJis TJoo 
TJs TJioo 


(9) 


with Doo and Dioo the angular diameter distance from the 
observer to a source at infinity, and between the lens and a 
source at infinity. We determine {/3s) and (/J^) by integrat¬ 
ing over the source redshift distribution, as determin e d from 
the photometric redshift catalogues of lllbert et al.l (|201.Jll . 
These reduced shear models are then fitted to the data. All 
lensing signals and model shears that we show in the fol¬ 
lowing sections correspond to the reduced shear. Note that 
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/3s and Scrit in Equation |S] are computed using the mean 
lens redshift for each cluster sample, rather than integrated 
over the lens redshift distribution. We have checked that the 
difference is of the order a few percent, and can be safely 
ignored. 

In short, the model that we fit to the lensing signal is 
given by 

ASniod(?') = AI]bcg(’') +PcASnfw(^) + 

(1-Pc)AEi„is(r)-I-AE2h, (10) 

with Pc the fraction of clusters that is correctly centred. 
Rather than pc we ht q, defined as Pc = 1/[1 + exp(—g)], 
which has an infinite range and can therefore be assigned 
a Gaussian prior. A high value of q corresponds to a low 
miscentring fraction. 


Table 1. Priors on fit parameters 


Parameter 

Prior-mean 

Prior-sigma 

logio(A/' 2 oo) [hjQ Mq] 

14.15 

2.00 

ln(c 2 oo) 

1.39 

3.00 

b 

2.62 

4.00 

In(o-a) Mpc] 

-0.84 

0.70 

q 

0.0 

1.2 

ClnM|N200 

see Table [5] 

0.2 

In(Cgal) 

1.39 

3.00 

Agal[/l70Af^^] 

0.05 

0.1 


2.3.1. Constraining the miscentring distribution 

Figure 6 in l.lohnston et ahl (1200711 shows that the parame¬ 
ters that describe the miscentring distribution are not con¬ 
strained by the lensing measurements. Their results are 
therefore sensitive to the adopted priors. If wrong priors are 
used, other parameters that are degenerate with the mis¬ 
centring parameters may get biased, such as the concentra¬ 
tion. To avoid such complications, we include the cluster- 
satellite correlation in the fit to obtain better constraints 
on the miscentring distribution, bypassing the need for us¬ 
ing informative priors on the miscentring parameters. 

The satellites of a cluster trace the dark matter distri¬ 
bution, although the slope of their radial distribution may 
differ fro m the slope of the projected total mass distribu¬ 
tion (e.g. Watson et al.ll201ft 120121 : iBudzvnski et al.lF2012t 
iTal et al.lllll2ll . Here. we use all red-sequence candidates 
at the cluster redshift brighter than M* -|- 1 as satellites. 
We use this magnitude cut to ensure that our selection is 
homogeneous and complete over the entire redshift range of 
our clusters. We measure their radial distribution by cor¬ 
relating their positions to those of the BCGs. If a frac¬ 
tion of the clusters is not correctly centred, this also affects 
the observed distribution of satellites. The cluster-satellite 
correlation therefore provides additional constraints on the 
miscentring distribution of BGGs. Note that miscentring 
has a significantly smalle r effect on S than on AS, which 
is illustrated in Fig. 4 in I Johnston et al.l (l2007ll . However, 
the signal-to-noise ratio of the cluster-satellite correlation 
is five to ten times larger than the one from shear, so it still 
provides useful additional constraints. 

We measure the cluster-satellite correlation as follows: 


Ccs(^) — ^rs 


Wra„d(r) TVr )' 


( 11 ) 


with N-cs{r) and Arand(?') the number of red-sequence 
galaxies and random points in a radial bin with a projected 
radial separation r from the BCG. and are the 

total number of red-sequence galaxies and random points, 
respectively. The part between brackets measures the 
overdensity of red-sequence galaxies with respect to their 
average number density, hrs- We therefore multiply it with 
hrs to convert it to the projected density of red-sequence 
galaxies in counts per Mpc^. The signal of the total cluster 
sample is shown in Fig. [S] 

To account for zero-point and PSF variations, we scale 


the number of randoms to the number of red-sequence 
galaxies in each held separately. Secondly, we determine 
the ratio of the total number of red-sequence galaxies and 
the total number of random points in all helds as a function 
of position in the camera, and correct our measurement 
with this ratio. The purpose of this correction is two-fold. 
First, it ensures that the random points have exactly the 
same footprint as the red-sequence galaxies. Secondly, 
it accounts for PSF variations across the camera. In 
particular, the chips in the corners of the mosaic have 
fewer galaxies, as the average PSF is larger than in the 
central chips. 

To model the cluster-satellite correlation, we assume 
that the satellites trace the surface mass dens ity of the dark 
matter, but, motivated by the results from Watsonet.^|y 
(l20Tnl [20 i 3); iBiidzvnski et~lH dMl : iTal et all (l2oTll 
and others, we allow the concentration of the satellites, 
Cgai, to vary. Since we do not know a priori how the 
number of satellites is related to the surface mass density 
of the model, we fit this with a nuisance parameter, 
following ^cs(c) = Agai X E(r), with Agai in units 
and marginalise over it. 


2.3.2. Intrinsic scatter mass-richness relation 


The mass-richness relation has intrinsic scatter; there¬ 
fore, the best-fit lensing mass is not equal to the mean 
mas s of the clusters in a bin. To account for this scat¬ 
ter, iJohnston et al.l (|2007ll integrate their models over the 
probability distribution of halo masses, P{M 2 oo), given by 
a log-normal distribution of M 200 for a given Abnn, and fit 
the variance in ln(M 2 nn) - The results of iBecker et al.l (|2007ll 
an d iEvrard et al.l (12(10811 are used as a prior on the variance, 
which are based on a satellite kinematics study of maxBGG 
clusters and simul ations, resp e ctivel y. 

More recently, iRozo et ahl (|2009ll studied the scatter in 
the mass-richness relation using X-ray measurements of the 
maxBCG clusters, and found = O.dbto/ig (95% 

confi dence limits). This variance is smaller than the one 
from iBecker et ah (|2007 ); the diffe rence was likely caused 
by the fact that Becker et al.1 (|20n7fl did not account for the 
miscentring of clu sters. Therefore, we use the results from 
iRozo et all (l2009ll to account for the scatter. 

Since the scatter cannot be constrained by our data, it 
is important to estimate the prior as accurately as possible. 
The scatter of the mass-richness relation from iRozo et all 
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Table 2. Cluster sample details 


bin 

N 200 

Hcius 

(z) 

(Woo 

) mso 

ClnM|N200 

A/200 

C 200 

b as 

Pc 

Cgal 

^gal 








[lOi^/ifo^A/ol 


[hn 



[x10^/17oA/q^] 


(1) 

(2) 

(3) 

(4) 

(5) 

(6) 

(7) 

(8) 

(9) (10) 

(11) 

(12) 

(13) 


0.20 < 2 < 0.35 













Nlzl 

2.00 - 7.63 

534 

0.30 

5.2 

4.0 

0.57 



9 f;+0-4 

^•0-0.4 

U.ZU_0.02 

0.3912,11 

19.51/0® 

9 q+0.4 
^•^-0.3 

N2zl 

7.63 - 10.47 

345 

0.28 

9.0 

7.5 

0.47 

8.9t“;« 

7.5l^-| 

q 1+0.5 

0 94+0.02 
'-'•^^-0.02 

0.2112:22 

90 0+15.7 

^y.O_g g 

q 9 + 0.3 
'^•^-0.3 

N3zl 

10.47 - 13.65 

268 

0.27 

11.9 

10.4 

0.46 

rj y+0.9 

' •' -0.9 

q q+1.4 
'^•'^-0.9 

9 t::+0.4 
^•^-0.4 

0 9CS+0 03 
U.ZO_Q Q3 

n 4S+0-09 

U.+O_0 09 

12.411:2 

r r|+0.5 
^•^-0.4 

N4zl 

13.65 - 18.14 

216 

0.27 

15.8 

14.3 

0.46 

11.8tlj 


q 1+0.5 

n qq+0.03 

U.OO_0.03 

A 97+0.05 
U.O/-0 05 

19.5I/0® 

4 q+0.5 
^•^-0.4 

N5zl 

18.14 - 26.31 

197 

0.25 

21.8 

21.2 

0.47 

is.stl't 

14.4i®i 

3.8l®-® 

0.3812,21 

0.3012:2! 

10.9l+,2 

5 0+0.5 
^•^-0.4 

N6zl 

26.31 - 220.00 

114 

0.24 

38.0 

36.6 

0.57 

OQ *^+3.2 

0 


9 O+0.9 

0.4212:12 

0.5912:12 

q 7+0.9 
‘ -0.7 

4 fl+0.3 

0.35 < 2 < 0.55 













Nlz2 

2.00 - 7.63 

1648 

0.47 

5.0 

4.0 

0.59 


12.7tli 

4.8l)!1 

0.2012:21 

n 1 7+0.06 
^•-*-^-0.05 

9.311-® 

9 f;+0.2 

Z.D_o 2 

N2z2 

7.63 - 10.47 

607 

0.46 

8.9 

7.7 

0.47 

8.7+1:5 

1 Q y+10.4 
' -4.8 

q r+0.5 
0.O-0.4 

0.2812,22 

U.ZO_0 00 

28.611+ 

4 4+0.5 
^•^-0.4 

N3z2 

10.47 - 13.65 

326 

0.46 

11.9 

10.5 

0.46 


4 1 +2-2 
^•-‘--1.4 

4 - i 1 o :6 

A 97+0.03 
-0.03 

U.OO_0 03 

10.8+2-2 

4 0+0.5 
^•^-0.4 

N4z2 

13.65 - 18.14 

216 

0.47 

15.7 

14.2 

0.46 

n.Qtli 

9.3l®;® 

4.8l°;® 

n Q1 +0.03 
U.Ol_o.o3 

0.3312:21 

10.912:2 

q 7+0.4 
‘ -0.3 

N5z2 

18.14 - 26.31 

126 

0.46 

21.2 

19.5 

0.47 

22.2+^-5 

14.1+“/ 

5 q+0.9 

Q3 

0.3012:2! 

19.611+ 

4 r+0.6 

^•^-0.5 

N6z2 

26.31 - 220.00 

59 

0.46 

36.9 

34.0 

0.56 

41.0t«i 

6.51^:^ 

5.111:2 

0 41+0.05 
U.+1-0.05 

rj 99+0.08 
u.o^_Q 0Y 

11.912:2 

q 0 + 0.5 
'J-O-0 4 

0.55 < « < 0.80 













Nlz3 

2.00 - 7.63 

1170 

0.64 

5.1 

3.3 

0.58 

4.9t°:» 

10.4t// 

6.312-2 

A 9q+0.03 

U.zo_o.o2 

A or + 0.09 
U.OO_o 09 

11.612:1 

3.012:2 

N2z3 

7.63 - 10.47 

670 

0.66 

9.0 

6.6 

0.46 


45.91®®+ 

4 0+O.8 
^"^-0.8 

0.2812,21 

A 99+0.05 
U.^O_0 05 

152.5112+ 

4.811:2 

N3z3 

10.47 - 13.65 

512 

0.67 

11.9 

9.4 

0.46 

11 . 0 t(® 

qc 7+68.0 
00 . < _20.7 

c c:+0.9 
^•^-0.8 

0 . 3012,22 

A 99+0.05 

U.^^_0 05 

1+23.2 

q 7+0.6 
'^•'-0.5 

N4z3 

13.65 - 18.14 

340 

0.67 

15.5 

13.0 

0.46 

i3.7tl:“ 

0 4 + 3.3 

c c + l-S 
^•^-1.0 

0 Q4+O.02 

U.O+_0.02 

u.zy_Q 05 

Cl 7+42.3 
< -19.3 

3 . 912:1 

N5z3 

18.14 - 26.31 

190 

0.67 

21.3 

20.1 

0.47 

18 . 11 ^:? 

10 . 31 / 3 ® 

6 . 411-2 

A OA+0.01 

U.oy_Q 02 

0 ‘:i1 +0.05 
'^•'^-‘--0.05 

135.8111+ 

4 . 411:1 

N6z3 

26.31 - 220.00 

73 

0.68 

36.0 

33.1 

0.55 


4.7l®;® 

6 . 711 :! 

A qq+0.05 
u.oo o og 

n 44+012 
'^•^^-0.10 

7.612:2 

q c+0.9 
'^•'^-0.6 


Notes. (1) richness range of the bin; (2) number of clusters; (3) median redshift; (4) mean richness; (5) mean richness corrected 
for Eddington bias; (6) prior-mean of the scatter in the mass-richness relation; (7) halo mass; (8) concentration; (9) halo bias; (10) 
miscentring width; (11) centring fraction; (12) concentration of satellite distribution; (13) amplitude of cluster-satellite correlation. 


is computed at a fixed richness. Our richness bins 
span a considerable range, which broadens the actual dis¬ 
tribution of halo masses. To obtain a more representative 
value for the scatter, we take the following approach. We 
use a mass-richness relation to predict the mass of each 
cluster in a particular richness bin, and adopt that as the 
mean of a log-normal probability distribution with a con¬ 
stant = 0.45. We then combine the probability 

distributions of all clusters in that richness bin, and fit a 
log-normal distribution to it. We adopt the best-fit width as 
the scatter and list it in Tabled We se t the prior width t o 
0 .2, which is the error on (TinM|N 2 oo from iRoz o et al. (l2009tl 
W e use the mass-richness relation from l.Tohnston et al.l 
(l200l . M 200 = 8.8 X 10i3(7V2oo/20)i'28/i-iMo. In princi¬ 
ple, we could follow an iterative approach where we update 
the mass-richness relation with our findings, but given that 
the relation we derive is not very different, and given that 
the derived scatter is fairly insensitive on the adopted rela¬ 
tion, we regard this as unnecessary. 


2.3.3. Model fitting 

To efficiently sample parameter space and fit the mod- 
els in a reasonable amount of time, we use Emcee 
(|Foreman-Mackev et al.l 1201, 3^ , the publicly available 
ensemble Markov Chain Monte Carlo (MCMC) sampler. 
The free parameters in this model are M 200 , C 200 j b, as, q, 


o'inM|N 2 oo> '-gal AgaP For Convenience, we summarise 
the priors in Table [TJ Note that for all but one parameter, 
(TinM|N 2 oo> priors are uninformative, hence the results 
do not depend on them. We assess how the best-fit masses 
depend on the prior of crinM|N 2 oo results section. 

We run Emcee with 200 walkers, starting near the 
best-fit model of each sample. The number of steps of 
each walker is set to 3 000. We conservatively discard the 
first 500 steps as the burn-in phase. The resulting 500 000 
model evaluations are used to estimate the parameter 
uncertainties. The fit parameters and their errors which we 
report in the following sections correspond to the median 
and the 68% confidence intervals of the marginalised 
posterior distributions. 

To fit the models to the data, we need to estimate 
the covariance between the data points. We do this by 
measuring the signal in each of our 145 2x2 degree patches, 
from which we create a large number of bootstrap reali¬ 
sations. The covariance matrices are estimated from the 
different realisations. We show a representative normalised 
covariance matrix for one of our cluster samples in Fig. 
m This figure shows that the lensing measurements are 
not correlated. We therefore set the off-diagonal elements 
of the lensing signal, and the correlation of the lensing 
signal with the cluster-satellite correlation, to zero. The 
cluster-satellite correlation itself, however, is correlated at 
large scales, hence we keep those off-diagonal terms. The 
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[ 2 . 00 , 7 . 63 ] [ 7 . 63 , 10 . 47 ] [ 10 . 47 , 13 . 65 ] [ 13 . 65 , 18 . 14 ] [ 18 . 14 , 26 . 31 ] [ 26 . 31 , 150 ] 



0.1 1.0 0.1 1.0 0.1 1.0 0.1 1.0 0.1 1.0 0.1 1.0 

Projected separation [/ij'jMpc] 


Fig. 7. Lensing signal AE as a function of projected separation from the BCG for the different cluster samples, split in 
richness (as indicated on top of each column) and redshift (indicated in each panel). The solid black lines indicate the 
best-ht cluster halo model, simultaneously fitted to the lensing signal and the corresponding cluster-satellite correlation 
signal in the range 0.15 < r < 5 h^Q Mpc. The dotted-dotted-dashed grey line shows the contribution from the BCG, the 
dotted red line the contribution from correctly centred clusters, the dashed green line the contribution from miscentred 
clusters, and the dotted-dashed blue line the contribution from neighbouring haloes. The dark red, orange and yellow 
shaded areas correspond to the 1-, 2- and 3 -(t model uncertainty. 


inverse of the resulting covariance matrix is used in the fit. 
To correct for the bias which is introduced when a noisy 
covariance matrix is inverted (jHartlao et al.l l2007tl . we 
multiply the inverse covariance matrix with a correction 
factor (Afgample A^data 2)/(Asample 1)5 where Afgample Is 
the number of independent samples used in the bootstrap, 
in our case 145, and iVdata is the number of data points, 
22 . 


2.4. Fit results 

We divide the clusters in bins of richness and redshift, 
as detailed in Table [2] Although inherently somewhat 
arbitrary, these ranges were chosen such that they enable 
us to reliably measure and study potential trends with 


richness and redshift. The stacked lensing signals are 
shown in Fig. [7] and the cluster-satellite correlations in 
Fig. [5J together with the best-fit halo models. The errors 
on the measurements correspond to the square root of the 
diagonal of the covariance matrix and indicate the 68% 
confidence intervals. The trends in the data, such as the 
kink at ~1 Mpc due to the miscentring of clusters, 
are well reproduced by the model. We find an average 
reduced chi-squared value of Xred = (with 14 d.o.f. per 
bin), suggesting that the data is well modelled. When we 
average the residuals of the ht for all bins, we find that 
the models underpredict the cluster-satellite correlation at 
scales 1-2 h^p^Mpc, exactly the regime that is difficult to 
model because of the overlap between the one-halo and 
two-halo term. 

To illustrate how well the model parameters are 
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[ 2 . 00 , 7 . 63 ] [ 7 . 63 , 10 . 47 ] [ 10 . 47 , 13 . 65 ] [ 13 . 65 , 18 . 14 ] [ 18 . 14 , 26 . 31 ] [ 26 . 31 , 150 ] 



Fig. 8. Cluster-satellite correlation signal, measured using red-sequence candidates brighter than M* -|- 1, as a function 
of projected separation from the BCG, for the different cluster samples, split in richness (as indicated on top of each 
column) and redshift (indicated in each panel). The solid black lines indicate the best-fit cluster halo models, obtained 
from simultaneous fits to the lensing signal and the cluster-satellite correlation signal in the range 0.15 < r < 5 
Mpc. The dotted red line shows the contribution from correctly centred clusters, the dashed green line the contribution 
from miscentred clusters, and the dotted-dashed blue line the contribution from neighbouring haloes. Note that the 
measurements are correlated at large scale (see Fig. [3, which is accounted for in the fit. The dark red, orange and yellow 
shaded areas correspond to the 1-, 2- and S-cr model uncertainty. 


constrained, we show the marginalised posteriors of the 
fit parameters for the N4zl bin in Fig. [HI together with 
the priors that were used in the ht. Only crinM|N 2 oo cannot 
be constrained by the data. For the other parameters, 
the choice of the prior is not important as they are well 
constrained. A wrong choice for the prior of o'inM|N 2 ooj 
however, could bias our results if degeneracies exist. To 
investigate this, we show the two-dimensional marginalised 
posteriors of all pairs of parameters in Fig. (TUI crinM|N 2 oo 
is only somewhat degenerate with M 2 Q 0 , but not with 
the other parameters. To assess the sensitivity of our 
results to the chosen priors, and to enable a more detailed 
comparison to the results of iJohnston et all (|2007f) . we also 
fit our cluster halo models but only to the she a r, ado pting 
the priors that were used in I.Tohnston et al.l (|2007fl . We 


discuss how that affects our results in the following section. 


3. Mass-richness relation 

In order to enable a comparison of our mass-richness rela¬ 
tion to simulations, we have to account for Eddington bias: 
the observational scatter which causes clusters to preferen¬ 
tially move from richness ranges where the abundance of 
clusters is high to where it is low. This is a separate ef¬ 
fect from intrinsic scatter, which defines the width of the 
halo mass distribution at a given richness if both quanti¬ 
ties could be measured with infinite precision, which we 
account for in our halo model. The observational scatter 
is mainly caused by the field-to-held variance of the back- 
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Fig. 9. Posteriors of the fitted cluster halo model parameters for the N4zl bin, marginalised over all other parameters. 
Black solid lines indicate the posterior, red lines the prior. The dashed vertical line indicates the median of the marginalised 
posterior, the blue shaded area is the 68% confidence interval around the median, and the dotted vertical line indicates 
the location of the best-fit value. Dimensions as in Tabled 


ground number density of red-sequence galaxies, for which 
we use a global estimate. We correct N 200 for Eddington 
bias using Bayes theorem. The probability distribution of 
the underlying N 200 given an observed value iV 2 Qg (the pos¬ 
terior) is proportional to the product of the chance of hav¬ 
ing a value of N 20 Q given a distribution of fV 2 oo (likelihood) 
and the probability distribution of A ^200 (prior): 

p(7V2oo|iV2°oo) ^p{N^^^\N 2 oo)p{N 2 oo). (12) 

The likelihood is determined by the measurement uncer¬ 
tainty of iV 2 oo, which our cluster finder provides. The er¬ 
rors are ~20% larger than Poisson, but we adopt a Poisson 
distribution as a reasonable first order approximation for 
the likelihood distribution. For the prior, we could in prin¬ 
ciple use the observed richness distribution. However, as 
shown in Fig. [I] the cluster sample is incomplete at the 
low richness end by an uncertain amount, and using it as 
a prior would lead to an erroneous correction. Since we ex¬ 
pect that the cluster sample is complete for approximately 
N 200 > 15j "we fit a power law to the richness distribution 
at 20 < N 2 Q 0 < 40. For the prior, we replace the observed 
richness distribution with this power law at N 200 < 20, 
whilst at larger richnesses we use the observed richness dis¬ 
tribution. We sum the posteriors of all clusters in a bin, 
normalise it and integrate up to the mean, These 

values are tabulated in Table O as well as the uncorrected 
values. Only the corrected richnesses are used in the follow¬ 
ing, unless explicitly mentioned otherwise. 

We show the mass-richness relation in Fig. [TT] Note 
that the mass we show is the mean of the log-normal 


Table 3. Power-law parameters of the fit between N 200 and 

M200- 


Bin 

A 

a 

0.20<z<0.35 

15.0 ±0.8 

0.73 ±0.07 

0.35<z<0.55 

22.5 ± 1.7 

0.86 ±0.08 

0.55<2<0.80 

18.7 ±2.2 

0.78 ±0.11 

maxBCG 

18.3 ±1.1 

0.98 ±0.07 

(3<A2oo<50) 




distribution of halo masses we integrate over in our 
halo model. We fit a power-law relation of the form 
M 200 = ^(-^ 200 * 720 )“ in each redshift slice and report the 
best-fit slopes and normalisations in Table [31 The errors 
on the amplitude are determined by marginalising over the 
slope, and vice versa. The likelihood contours of the fit 
are shown in Fig. |T3] Without the correction for Eddington 
bias, we would have obtained A = (14.4±0.7) x Mq 

and a = 0.83 ± 0.08 for the 0.20<z< 0.35 bin, which devi¬ 
ates by approximately Icr. 

A number of complications limit a simple interpreta¬ 
tion of the weak leasing mass estimates of clusters, such 
as intrinsic profile variations of dark matter haloes (e.g . 
IClowe et al.ll2004 ICorless fc Kindl2007t iGruen et al.ll2015 ) 
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Fig. 10. Posteriors of the fitted cluster halo model parameters for the N4zl bin, for all pairs of parameters. Shown are 
the Icr, 2a and 3 ct confidence regions. The best-fit values are indicated by the green open diamonds. These plots illustrate 
the degeneracies that exist between the fit parameters. Dimensions as in Tabled 


and the presence of correlat ed and uncorrelated structure 
along the line-of-sight (e.g . iMetzler et al.l l200lL iHoekstr^ 
l2001t iHoekstra et al.l 1201111 . These complications mainly 
increase the scatter of the mass estimates, but may even 
lead to s mall (^5-10%) biases if model fitting techn iques 
are used (iBecker &: Kravtso^l2011l : iRasia etlih l2012ll . The 
lensing signal can be modelled i n various ways, and partic - 
ular choices can reduce this bias (|Mandelbaum et al.ll2010ll . 
More detailed numerical simulations are required to quan¬ 
tify this bias more precisely, e.g. as a function of mass and 
redshift, to interpret the results correctly. This is impor¬ 
tant for the exploitation of clusters as a reliable tool for 
cosmology. 


3.1. Comparison to previous single-redshift results 


We compare our results to the weak lensing an alysis of 
the maxBCG cluster sample (jKoester et ahll^OTf) . a cata¬ 
logue of 13 823 clusters that has been detecte d in the SPSS. 
The c luster detection algorithm employed in iKoester et al.l 
(|2007ll identifies the cluster red-sequence galaxies, and se¬ 
lects the brightest, the BCG, as centre of the cluster. The 
resulting cluster sample covers the richness range 10 < 
N^nn < 190 a r id a redshift range of 0.1 < z < 0.3. In 
LSheldon et al.l (l2009ll . the cluster sample is extended to 
N 200 = 3, which leads to a sample of ^130 000 galaxy 
groups and clus ters. The lens i ng ana lysis of the sample 
is presented in LSheldon et al.l (l2009ll: the in ass-richness 
relation is derived in l.lohnston et al.l (|2007ll . Following 


Mandelbaum et al 

■Tohnston et al.l (|2007ll 


(|2008bll . we multiply the masses from 
by a factor 1.18 to account for 


the impact of photometri c redshift sc a tter i n the lensing 
analysis (as was don e in iRozo et"^ (|2010lB . Note that 
I Johnston et al.l (1200711 use the same definition of mass as 
we do. 

The richnesses of iJohnston et all (|2007ll have not been 
corrected for Eddington bias. We perform the correction, 
using a probability distribution for the maxBCG sample of 
pi N^nn) PC (iV 2 nn)~^ over th e entire richness range, follow¬ 
ing lAndreon fc Huriil (|2010ll , which is in good agreement 
with a slope of —3.06 that we find for our clusters with 
0.2 < 2 < 0.35 and 7V2oo>15. We adopt the mean rich¬ 
ness as iV 200 to compute the posterior, instead of stack¬ 
ing the posteriors of the individual clusters. This makes a 
negligible difference. We show the corrected results in Fig. 
m We fit the same power law in the overlapping richness 
range, 3 < N^qq’^ < 50, and list the best-fit parameters in 
Table [31 The amplitude and slope of our low-redshift sam¬ 
ple are about Scr lower than those of the maxBCG. Figure 
HD shows that this discrepancy is partly driven by the bins 
at A^2 oo<10 . If we fit the relation at A^200>10, the slopes 
are consistent but the amplitudes still differ by ~2.5cr. 

There are several differences between the analyses. For 
instance. Fig. H] suggests that our richnesses are somewhat 
larger than those of the maxcBCG for rich systems; how 
they differ at low N 200 is unclear as the public maxBGG cat¬ 
alogue only includes iV2oo>10 clusters. If our richnesses are 
systematically larger than maxBCG for richer systems, this 
would tend to lower our normalisation and slope, which may 
partly explain the discrepancy. The number of matching 
RCS2 and maxBCG clusters is too low to assess this quan¬ 
titatively. Also the purity of the two cluster samples may 
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radial bin 

Fig. 6. Normalised covariance matrix between the radial 
bins of the leasing measurement and the cluster-satellite 
correlation for the clusters in the N4zl sample (the fourth 
richness bin of the first redshift slice, see Table [2]). The first 
11 bins are the radial bins of the leasing measurements be¬ 
tween 0.15 < r < 5 h^Q Mpc, the second set of 11 bins are 
the radial bins of the cluster-satellite correlation between 
0.15 < r < 5 Mpc. 


differ, as different clus ter detection algorit hms have been 
used on different data. iKoester et al.l (|2007ll show how the 
purity of the sample depends on particular settings of the 
maxBCG algorithm using mock catalogues. For maxBCG, 
the purity is typically of the order 90% or higher at rich¬ 
nesses N 200 > 10 ; how the purity varies at lower richnesses 
is not reported. Since the purity of the RGS2 cluster sample 
is expected to be high as well, it seems unlikely that differ¬ 
ences in the purity could lead to differences larger than a 
few percent in the masses. 

Also on the modelling side, there are noticeable dif¬ 
ferences. For example, we use different priors for the mis- 
centring distributi on and for the scatter between richness 
and halo mass. In I Johnston et aP l|2007r) . the miscentring 
priors are based on numerical simulations, from which a 
functional form is derived for q, with a corresponding pc 
(the fraction of clusters that is correctly centred) ranging 
from 60% to 80% for richnesses in the range of N 200 = 10 
to ^”200 = 100. The width of the miscentring distribu¬ 
tion is fixed at 0.42h“^Mpc. Our best-fit parameter sug¬ 
gest a Pc in the range of 20-50%, and a narrower width 
(Ts « 0.2-0.4 h^Q Mpc. It is conceivable that the miscen¬ 
tring distribution differs for the two catalogues, as different 
algorithms have been applied to different data to identify 
the BGG. However, if the actual miscentring distributions 
are similar, the different priors could cause discrepancies 
between the best-fit masses and concentrations due to the 
parameter degeneracies. The same holds for the adopted 
prior for the scatter in mass-richness. Motivated by this, we 
run our cluster ha lo models on the leas ing data only, us¬ 
ing the priors from l.lohnston et all (|2007fl . Note that we do 
not account for the differences between the two-halo terms. 
Nevertheless, this test will give us a reasonable impression 



Fig.11. Gluster mass versus richness for the cluster sam¬ 
ples as indicated in the plot. The red circles correspond the 
results from the lensing analysis of the maxBGG clusters of 
I Johnston et al.l l|2007f) . which have been boosted by a factor 
1.18 to account for the impact of photometric redshift scat¬ 
ter (see text). All the measurements that are shown here 
have been corrected for Eddington bias. 


how sensitive our results are to the adopted priors. 

The resulting best-fit masses of the individual bins are 
consistently lower but within the Icr error bars of the nomi¬ 
nal results. When we refit the power-law relation, we find an 
amplitude and slope of A = ( 12.2 ± 1 . 2 ) x lO^^h^g^M 0 and 
a = 0.69 ±0.15 for the 0.20<z<0.35 redshift slice, which is 
even more discrepant. Gomparing this to our nominal re¬ 
sults shows that the tighter constraints on the miscentring 
distribution, obtained from including the cluster-satellite 
correlation in the fit, also leads to smaller errors on the 
mass. As the amplitude is ~2(T lower than our nominal re¬ 
sult and the slope is consistent, our results do not critically 
depend on the adopted priors. Furthermore, we have also 
checked that by changing the implementation of the two- 
halo term, the masses do not change significantly. 

The mass-richness relation has also been derived from 
semi-analytic galaxy forma tion models base d on N -body 
simulations. For example, iHilbert fc White! (|201flll used 
the Millennium Simulation (|SDringell 200,^ and found 
that the derived mass-richness relation agreed well in 
shape and ampl it ude w ith the relation from maxBGG. 
iHilbert fc Whitel (l201(lll report an amplitude and slope 
of A = (17.7 ±0.1) X and a = 1.09 ± 0.01. 

The amplitude and slope are a bit higher than our low- 
redshift results. The difference is again mainly driven by the 
A^ 2 oo <10 results, but additionally, some diffe rence may be 
caused by the different definitions of richness. I Angulo et al.l 
(| 2012 ll measure the relation between optical richness and 
mass at z=0.25 in the Millennium-XXL simulation, which 
extends the Millennium and Millennium- 11 Simulations 
(ISoringell 120051: iBovlan-Kolchin et al.ll200^ . A power-law 
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Fig. 12. 67.8%, 95.4% and 99.7% confidence limits of the 
fits to the mass-richn ess relation. The black dotted lines 
show the results from iJohnston et al.l (|2007fl in the over¬ 
lapping richness range. 


slope of 1.07 is reported, steeper than our results, but again 
the difference may originate from the low-richness end and 
from differences between the richness estimators. 

In iFord et al.l (|2015ll . a sample of 18 000 optically se¬ 
lected clusters at 0.2<z<0.9 are studied in the Canada- 
France-Hawai Telesc ope Lensing Survey (CFHTLenS; 
iHevmans et al.l l2012ll . Clusters are identified u si ng th e 
3D-Matched-Filter finder of iMilkeraitis et al.l (l2010ll . 
Richnesses are defined as all cluster members within r2oo 
that are brighter than an absolut e f-ba nd magnitude of 
— 19.35, as detailed in iFord et all (l2014ll . It is not clear 
how that relates to our richness estimate. Using the same 
parametrisation of the mass-richness relation as we do, 
they Hnd an amplitude and slope of 2.7lo.4 x IO^^Mq and 
1.4±0.1, respectively. No correction for the Eddington bias 
is performed, leading to a larger slope. However, the nor¬ 
malisation is significantly lower than what we find. The 
difference may be partly attributed to the definiti o n of r ich- 
ness (see the discussion in Sect 5.3 of iFord et al.l (|2015ll '). 

3.2. Redshift evolution 

At any given richness, clusters at 0.35<z<0.55 have larger 
masses than those at 0.2<z<0.35. Whether this trend con¬ 
tinues towards higher redshifts is not clear given the large 
errors on the mass for the 0.55<2<0.8 clusters, although 
our results suggest that it does not. 

Both the halo masses and the richnesses of clusters may 
evolve. The bulk of the change in halo mass is not ex¬ 
pected to be phys ical accretion, but r ather an effect called 
pseudo evolution (|Diemer et al.ll20I^ . Halo masses are de¬ 
fined with respect to a background density (usually the 
mean or the critical density), which evolves with redshift. 


Even if a halo does not accrete anything, its mass increases 
with time as the background density drops. If the richness 
of a cluster would not change, we would expect to see an 
increase in halo mass towards lower redshifts. The fact that 
we find the opposite trend and that the halo mass decreases 
at a given richness, points towards an evolution of the rich¬ 
ness. 

Clusters build up their red sequence over cosmic time. 
The cluster galaxies are stripped of their gas through 
tidal interactions and ram pr essure stripping, whic h 
quenches their star formation (e.g. lBoselli fc Gavazzil[200^ . 
Consequently, the late-type spiral galaxies that are accreted 
turn into early-type SO galaxies, and subsequently appear 
on the E/SO ridge line. Hence even without accreting new 
galaxies, A^200i the red-sequence richness, increases as more 
galaxies turn red. Various works have reported an increase 
of the number density of fai nt red-sequence c l uster members 
towa r d low redshift (e.g. |Loh_et_^l| 


2008; iRudnick et al. 120091: Jaffe et al 


2008t iGilbank et al 


201 It IVulcani et al 


2011 1. For example, Rudnick et al. 1(200911 measure the op¬ 


tical cluster luminosity function of red-sequence galaxies at 
2<0.8. They find that at magnitudes brighter than M*, 
the luminosity function does not evolve much, suggesting 
that these cluster members are already in place. However, 
at fainter magnitudes, the luminosity function strongly in¬ 
creases towards lower redshift. This supports the view that 
cluster richnesses (defined with a M* + 1 magnitude limit, 
as we do) may become larger towards lower redshift, in line 
with what our results suggest. There may be other pro¬ 
cesses that could cause an evolution of ^200- Mass segrega¬ 
tion could lead to more bright cluster members within the 
inner one Mpc, which would boost A"gai, leading to higher 
values for rfgQ and Mergers of cluster members both 

brighter than M* -|- I would lower V200, but mergers of 
faint red-sequence members could increase it. The build-up 
of the red sequence, however, is likely the dominant effect. 

The redshift dep endence of the mass -richness relation 
was also measured in ISheldon et al.l (|2009ll for the maxBCG 
clusters, but due to the limited redshift range of that sam¬ 
ple no change with redshift was found. However, in a study 
of the relation betwee n X-ray luminosity and richness for 
the maxBCG clusters, iRvkoff et al.l (1200811 found that the 
X-ray luminosity at 2 = 0.2 8 is twice as high a s the X- 
ray luminosity at 2 = 0.14. iBecker et al.l (|2007^ studied 
the relation between velocity dispersion and richness for 
the same clusters, and found that the clusters at high red- 
shifts systematically hav e higher veloc i ty dis persions. Both 
IBecker et al.l (1200711 and iRvkoff et al.1 (l2008ll expected the 
main cause to be the evolution of the V200 richness mea¬ 
sure, implying a fractional decrease in V200 of 30%-40% 
from 2 = 0.14 to 2 = 0.28. We ignored this effect when we 
compared our results in Fig. 1111 but since the mean redshift 
of maxBGG clusters is 0.25, very close to the mean redshift 
of our low-redshift clusters, it is not important. If we ignore 
the potential evolution in halo mass, our results suggest a 
fractional decrease of 34 ± 4% in ^”200 from 2 = 0.27 to 
2 = 0.46. _ 

lAndreon fc GongdonI (|2014il measure the richness and 
mass for a sample of 23 very massive clusters with 
0.15<2<0.55 within a hxed aperture of 0.5 Mpc. This 
‘aperture’ mass-richness relation does not evolve with red¬ 
shift. Most of the accretion and quenching of new cluster 
members happens at the outskirts of galaxy clusters (e.g. 
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Fig. 13. Cluster mass versus the width of the miscentring 
distribution (top), versus the width divided by r 2 oo (middle) 
and versus the fraction of ‘correctly centred’ BCGs, the 
ones that are located at the centre of the halo (bottom). 
The black dashed lines show the fitted relation between 
miscentring parameter and mass, as described in the text. 


Ivan der Burg et aI1l2015H . and may be missed when using 
an 0.5 Mpc aperture. Furthermore, the selection of red clus¬ 
ter galaxies and the computation of richness differ from our 
work, which may contribute to the apparent discrepancy 
between the results. 


4. Miscentring distribution 

By htting the shear measurements together with the 
cluster-satellite correlation, we obtain, for the first time, 
tight constraints on the miscentring distribution of BCGs 
with respect to the centre of their dark matter haloes. 
Figure [T2] shows that ~30% of our BCGs are located at 
the centre of the halo. The distribution of the miscentred 
BCGs is described by a 2D-Gaussian with a standard 
deviation that increases from ~0.2 to ^0.4 ^70 Mpc 
going from our poorest to our richest clusters; the ratio 
of width and r 2 oo is flat with mass. Since the miscen¬ 
tring parameters do not appear to evolve with redshift, 
we combine all our cluster samples and parametrise 
the relations as Pc = Ap + Bp x (logiQ(M2oo) — 14), 
obtaining Ap = 0.29 ± 0.02 and = 0.16 ± 0.07, 

and as = Act + X (logio(M2oo) - 14), finding 
= 0.29 ± 0.01 [/ito Mpc] and B^ = 0.31 ± 0.03 

[/i7Q^Mpc/logio(h7Q^M0)]. Since cluster members are 
spread over a larger volume in more massive clusters, we 
expect that the mass dependency is reduced when we 
consider the ratio (Js/'r 2 m- Indeed we find this ratio is 
consistent with a constant, for which we obtain a value 
of 0.44 ± 0.01 for all samples combined. The average 
for the low-, intermediate- and high-redshift slices are 
0.42 ± 0.02, 0.40 ± 0.02 and 0.51 ± 0.02, respectively. Note 


that we account for the errors on both the dependent 
and independent parameters in the fit. The high-redshift 
slice has an intrinsically broader miscentring distribution, 
which may reflect that more clusters are still undergoing 
mergers at that redshift, or that the BGG is more often 
misidentified due to increased photometric errors. This 
mi scentring distrib u tion is different from the one adopted 
in I Johnston et al.l l|2007f) . which was based on mock 
catalogues determined from simulations that were fit to 
a range of observations. There, a much higher fraction of 
60-80% of the BGGs was found to be correctly centred. 
The width of the miscentring distribution was found to 
be 0.42 /i“^Mpc, larger than what we hnd. The lensing 
signal itself cannot discriminate between their miscentring 
distribution and ours, but the cluster-satellite correlation 
is able to break the degeneracy. 

It is possible that the difference between the miscen¬ 
tring distributions is the result of the different algorithms 
used to identify the BGG. This could be tested by applying 
both algorithms to the same set of simulations or data. 
However, part of the discrep ancy could also be du e to 
how the simulations used in iJohnston et al.l (120071) are 
constructed. In these simulations, every dark matter halo is 
assumed to have a BGG at its centre. Hence misidentifying 
the BGG is the only reason why the centring fraction 
is less than 100%. However, in reality, the central BGG 
may actually be displaced by a small amount from the 
centre of the halo. This would push the centring fraction 
down and lower the width of the miscentring distribution, 
more in line with our findings. Also, some BGGs may be 
star forming, leading t o too blue colours to be selected as 
red-sequence member (|Bildfell et al.ll200^ . 

The location of BG Gs in cluster s has been studied 
in various other works. ISkibba et al.l (l201l!) study mock 
catalogues and SDSS group catalogues and report that 
in 40% of their groups with a mass larger than 5x10^^ 
the brightest galaxy is a satellite gal axy instead of 
the ce ntral galaxy. Using N-body simulations. iMartel et al.l 
(|2014[) find that the fraction where the brightest galaxy 
is not the nearest to the centre increases from ^^25% 
to ~50%, w ith a higher miscen tring fraction towards 
higher mass. iHoshino et al.l (l2015li stu dy the distribution 
of LRGs in the redMaPPer clusters (iRykoff et al.l l2014fl 
and find that 20-30% of the brightest LRGs are not the 
central galaxy. As the central galaxy might be somewhat 
offset from the peak of the dark matter, these results 
might be in a g reeme nt wit h our results. Remar kably, both 
ISkibba et al.l (|201lli and iMartel et al.l (|2014ll find that 


the centring fraction actually decreases with increasing 
mass, in contrast to what is assumed in Johnston et al.l 
(|2007l) . This is attributed in IMartel et al.l (^2014^ to cluster 
mergers, which haye occurred most recently in more 
massiye clusters as they are the last ones to form. Our 
ce ntring fraction s do n ot show a clear trend with mass. 

iGeorge et al.l (l2012ll study the miscentring distribution 
in 129 X-ray-selected galaxy groups using their stacked 
weak-lensing signal, for different group centre definitions. 
In their model, all centres follow a 2D-Gaussian distri¬ 
bution, as their data does not require a correctly-centred 
component. Adopting the brightest group galaxy within 
7200 as the group centre, they report as = 24.8 ± 12.0 
kpc, significantly smaller than the typical yalues for cr^ 
that we find. Since the groups were X-ray selected, the 
sample may contain relatiyely more relaxed systems, whose 
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Fig. 14. Cluster mass versus concentration for the differ¬ 
ent redshift sli ces. The lines show th e mass-concentration 
relation s from Johnstonetalj (20071. Mandelb aum et id] 
(I2008all. iNeto et all (|2007ir buffv et all (1200811 and 


button &: Macciol (|2014lh as indicated in the figure. 


BGGs could be closer to the centre of the dark matter 
dis tribution than for the full population of groups. 

IZitrin et al.l (|2012ll studied the miscentring distribution 
of BGGs in 10 000 SDSS clusters, under the assumption 
that light traces mass. Also in that work, a much narrower 
miscentring distribution is reported, with a typical width 
of 15 kpc. Since the BCG is usually by far the brightest 
galaxy in a c luster , this result may not be that surprising. 
IZitrin et al.l (l2012ll do not characterise the offset distribu¬ 
tion of the misidentified clusters, which constitutes about 
10% of th e sample. Par t of th e discrepancy b etwee n the 
results of iGeorge et al.l (|2012ll : IZitriii et al.l (l2012ll and 
our findings may be caused by the fact that we include 
a correctly-centred component, as our data require it. 
Forcing all our BGGs to follow a 2D-Gaussian would lower 

CTs- 


5. Mass-concentration relation 


Figure [TJ] shows the relation between mass and concen¬ 
tration, together with a number of literature results: 
the mass-concentrati o n rela tion of the maxBGG clusters 
from I Johnston et al.l (l2007f). at a mean re dshift of 0.25; 
the results from iMandelbaum et al.l (|2008all . who derived 
the mass-concentration relation by combining lensing 
measurements for L=i=-type galaxies, galaxy groups traced 
by LRGs and the maxB GG sample, f or a m ean redshift of 
z = 0.22; the relation of iNeto et al.l (l2007ll . derived using 
the Millenn ium Simulation at z = 0; the relation from 
iDuffv et al.l 


at z = 0.46, based o n large N-body 
simul ations using the WMAP5 cosmolo gy (iKomatsu et ah 
l2009li: and, finally, the relation from iDutton fc Maccib 
(|2014li at z = 0.46, derived from N-body simulations using 


the Planck cosmology (jPlanck Gollaboration et ^l2014^ . 

The concentrations for our low-redshift slice agree 
well with the literature results. For exa mple, if we fit 
the mass-concentration r el ation from I, Johnston et al.l 
(|2007D (|Mandelbaum et ^ (|2008alB to our results, we 
find a relative normalisation of 1.16 ± 0.16 (1.03 ± 0.14), 
consistent with unity. The concentrations for our clus¬ 
ters in the 0.35<z<0.55 range are higher. If we fit the 
mass-concent r ation relation from iDutton &: Macci^ (|2014f) 
(jPuffy et al.l (|2008li J at z = 0.46, we find a relative 
normalisation of 1.78 ± 0.31 (2.28 ± 0.40), 2-3 ct larger 
than unity. For the highest redshift slice, the relative 
normalisation is 1. 67 ± 0.54 (2.12 ± 0. 69) w it h respect to 
the re lation from IDutton fc MaccicJ (|2014f) (|Duffv et al.l 
(|2008lB at z = 0.67. Note that we have ignored the error 
on the mass in deriving the relative normalisations, which 
should not matter much as the concentration changes only 
very weakly with mass. 

To investigate how sensitive our results are to the cho¬ 
sen prior, and to simultaneously fitting the cluster-satellite 


correlation, we look at how our results chan ge when we only 
fit the lensing signal, using the priors from I, Johnston et al.l 
(I2OO7T) . The resulting relative normalisation of the mass- 
concentration relation of the 0.20<z<0.35 redshift slice 
becomes 0.85 ± 0.08 (0.76 ifc 0.07) w ith respect to the 
relation from iJohnston et al.l (1200711 (jMandelbaum et "ni 
(1200^ 1. This is about 2cr lower than our nominal result 
and therefore not significant. Figure [10] shows that q and 
C200 are anti-correlated, such that forcing q to high values 
(lowering the miscentring fraction) pushes C200 down, in 
line with our results. Note that t he constraints become 
tighter when we use the priors from IJohnston et al.l (|2007f) 
because of the use of informative priors on q and erg. 
Finally, we have also checked that our results are robust 
against moderate changes in modelling the two-halo term. 
Only if we apply a large boost to the 2-halo term, for 
example by using the non-linear power spectrum instead 
of the linear one, the best-fit concentrations become 
significantly higher. 

Our intermediate- and high-redshift clusters have a 
concentration that is higher than what is expected from 
dark-matter-only simulations. The question of whether 
clusters are over-concentrated with respect to simulations 


Umetsu et al. 20111 

King- & MeadI 2011: Oguri et al.l 20121: 

Auerer et al.l 120131: 

Foex et al.l I20l4 lUmetsu et al.l l2014l1. 


Most of these works focus on strong lensing systems, which 
are expected to have higher projected conc entrations than 
typical clusters of similar mass and redshift (iHennawi et al.l 
I2OO7I : iQguri et al.l 12009112012D . On the other hand, weak- 
lensing studies of optically selected low-redshift galaxy 
clusters, as well as our 0.20<z<0.35 results, do not show 
a strong deviation from the mean mass-concentration 
relation predicted in AGDM cosmologies. 

Any red-sequence cluster finder that applies a spatial 
filtering, like ours, will have some preference for selecting 
over-concentrated clusters, that is, structures elongated 
along the line of sight. To check whether it is possible that 
our results are affected by selection effects, we estimate 
the completeness. Jf our sample is complete, we cannot 
systematically miss the under-concentrated clusters. We 
compute the cumulative halo mass function at the average 
redshift of each of the three redshift slices and multiply 
that with the volume in that slice. This gives us a crude 
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Table 4. Fit parameters of the relation between Agai and 
-^ 200 - 


Bin 

B 

ft 



[xlO^] 

0.20<2<0.35 

4.1 ±0.2 

2.2 ±0.6 

0.35<2<0.55 

3.3 ±0.1 

2.1 ±0.5 

0.55<2<0.80 

3.8 ±0.3 

1.0 ± 1.0 


estimate of the expected number of clusters above a 
given mass in the RCS2. Next, we use our mass-richness 
relation to assign each cluster a mass and compute the 
observed cumulative cluster mass function. Comparing 
this to the theoretical prediction gives us an estimate of 
the completeness. For our lowest redshift slice, our cluster 
sample is roughly complete at M2Q0 > for our 

intermediate-redshift clusters, we find a completeness of 
70-80% in the same mass range; and for the high-redshift 
clusters, the completeness is less than 50%. Note that these 
are crude estimates, but they show that it is possible that 
our intermediate- and high-redshift clusters are affected by 
selection effects. 

Another effect that might affect the concentrations are 
projections of pairs of clusters. Large-scale structure may 
be present between the clusters, which boosts the projected 
density at small scales. Additionally, the two parts of the 
projections will have lower masses and therefore on average 
higher concentrations, as the concentration is a decreasing 
function with mass, although this effect is small as the 
mass-concentration relation is fairly flat. If the projected 
separation of the two parts is small, both effects would 
increase the concentration. Larger projected separations 
are more likely, however, and would tend to lower the 
concentrations. Note that we find further evidence for 
the presence of selection and projection effects from the 
constraints on the halo bias, which we discuss in the 
following section. The bias on the concentration caused 
by triax iality or halo sub structure is expected to be much 
smaller (iBahe et al.ll2012ll . 


6. Satellite distribution 

We show the parameters that describe the satellite distribu¬ 
tion in Fig. [15] Cgai is the most sensitive parameter to how 
we model the two-halo term. Given the large uncertainties 
in modelling the one-to-two-halo transition regime, the re¬ 
sults need to be interpreted with great care. The reason why 
Cgai is so sensitive, can be seen as follows. Different imple¬ 
mentations of the two-halo term mainly change the signal 
at small scales, but not the large (linear) scales. When we 
for example increase the signal from the two-halo term at 
small scales, where there is overlap with the miscentred 
term, the one-halo profile needs to become steeper, which 
is done by forcing Cgai upwards. 

Since a different implementation mainly results in a 
rescaling of Cgai, we can only safely interpret the trends in 
the data, but not the values themselves. Our data suggests 
that the concentration of the satellite distribution decreases 



.■ I■ ■ ■ ■ 

10'^ 

■^20o[^70 ^ q \ 


Fig. 15. Parameters describing the dark matter distribu¬ 
tion (horizontal axis) versus parameters describing the dis¬ 
tribution of satellites (vertical axis) for the different redshift 
samples, (top left): halo mass versus concentration of the 
satellite distribution; (top right): dark matter concentration 
versus concentration of the satellite distribution, with the 
one-to-one relation as solid black line; (bottom left): halo 
mass versus scaling of the surface mass density to the num¬ 
ber density of satellites. 


with mass. If we fit a linear relation between log2o(Af2oo) 
and Cgai, we obtain a non-zero slope of —10.7±2.3. Secondly, 
Cgai and C200 are correlated. A correlation between C200 and 
Cgai is expected, given that red-sequence galaxies trace the 
dark matter. Figure dUI suggests that this is not the result of 
a degeneracy between the parameters. We quantify this cor¬ 
relation with the Pearson coefficient, which has a value of 
0.67j'jg ]4. The error on the Pearson coefficient corresponds 
to the 68% confidence intervals, obtained from 10 000 ran¬ 
dom realisations of the data, in which we draw new con¬ 
centration values from a Gaussian whose mean equals the 
best-fit value and whose width corresponds to the observed 
error. 

Finally, our measurements show that Agai, the scaling 
between the model surface mass density and the number 
density of red-sequence members brighter than M* + 1, 
increases with mass. Since the concentration of the satel¬ 
lite distribution is allowed to differ from that of the dark 
matter, this cannot be directly interpreted as a change in 
scaling between the surface mass density of dark matter 
and the projected density of red-sequence cluster mem¬ 
bers brighter than M* + 1. Agai merely serves as a nui¬ 
sance parameter. For completeness, we parametrise this as 
^igai = B + ft X log4Q(M2oo/10^'^) and list the best-fit val¬ 
ues in Table Sj These results are not sensitive to moderate 
changes in the two-halo term. 
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^20o[^70 ^©] 

Fig. 16. Cluster mass versus bias from our halo model 
fits to the leasing and cluster-satellite signals for our clus¬ 
ter sa m ples. The line s show th e results from iTinter et al.l 
dMnll. iSheth etalJ lt20nl . iTinker et al.l (1200511 and 
I Johnston et al.l ( 200711 . as indicated in the figure. 


more constraining power on the amplitude of the two-halo 
term. Since the large-scale lensing signal is generally weak 
and therefore more susceptible to biases due to potentially 
remaining systematics in the shape catalogues, we deem 
that further testing of the robustness of our lensing signal 
is required before we can extend the fitting range. Also the 
cluster-satellite correlation becomes harder to measure re¬ 
liably at larger scales, as it becomes increasingly sensitive 
to the background density. 

We have tested how our results change when we imple¬ 
ment a different prescription for the two-halo term. The 
biases are fairly robust against moderate changes. Only if 
we apply a large boost to the two-halo term, for example by 
replacing the li near power spectrum with the non-linear one 
(as is done in (ICacciato et al.ll2009l : Ivan Uitert et al.ir2011l : 
iLeanthand et al.l 201211 1. the values become ~10% lower. 
Next to that, we note that our halo models on average un¬ 
derpredict the cluster-satellite correlation at 1-2 /i^Q^Mpc, 
which may bias the halo bias high. 

The difference between our bias values and the liter¬ 
ature relations is larger for our intermediate- and high- 
redshift clusters. This may be caused by the same combina¬ 
tion of effects that might explain the high concentrations. If 
our cluster finder preferentially selects overdense clusters, 
the large-scale structure will also be preferentially oriented 
along the line-of-sight. Similarly, if our cluster is actually a 
projection, more additional structure may be present along 
the line-of-sight. Both would lead to a larger projected clus¬ 
tering of matter, and hence a larger bias. 


7. Mass-bias relation 


We show the relation between cluster mass and bias in Fig. 
1161 together with a number of relati ons from the lite r ature , 
including the relation derived in I.Tohnston et al.l (l2007ll 
for th e maxBCG clusters, the prediction from ISheth et al.l 
(IMl based on an ellipsoidal collapse model and cali¬ 
brated on n umerical simulation s, with further refinements 
presented in I Tinker et al.l (l2005ll . and a new f unctio nal form 
for the mass-bias relation from iTinker et all (|201(lll , derived 
from a large set of collisionless numerical simulations based 
on ACDM. For clarity, we only show the relations at a red- 
shi ft of z = 0.27 , exce pt for the most recent one, that is, 
the iTinker et al.l (|201(tl relation; the relation with the low¬ 
est (highest) amplitude is for z = 0.27 {z = 0.67). 

Our bias values increase with redshift as expected (for 
a compilation of redsh ift-dependent bias models, see Fig. 1 
of IClerkin et al.l [201511 . but they are somewhat larger than 
the relations from the literature. Since our clusters do not 
span a wide range of mass, we only constrain the relative 
normalisat ions with respect t(3 t hese relations. Fitting the 
relation of ITinker et al.l l|20in[l (iJohnston et al.l (1200711 1 at 
z = 0.27 to our low-redshift results, we find a relative nor¬ 
malisation of 1.24±0.08 (1.73±0.11). For the intermediate- 
and high-redshift slices, we find a relative normalisation of 
1.45T0.08 and 1.5 1T0.11 with respect to the relation of 
ITinker et al.l (|2010ll at z = 0.46 and z = 0.67, respectively. 
Note that we have again ignored the errors on the mass 
in deriving these relative normalisations, which should not 
matter much as the bias is only a weak function of mass. 

If we use the priors from I Johnston et all (l2007ll and only 
fit the lensing signal, the errors on th e bias blow up since the 
lensing signal at large scales is noisy. IJohnston et al.l (|2007ll 
fit the lensing signal out to 30 h~^ Mpc, which provides 


8. Conclusion 

We present the results from a combined weak-lensing 
and cluster-satellite correlation analysis of ^lO'^ clusters 
in the RCS2. These clusters span a range of 0.2<z<0.8 in 
redshift and have typical masses M2oo>5 x Mq. 

We divide the clusters in three redshift slices and six 
richness bins, and measure the average lensing signal and 
cluster-satellite correlation for each sample. Satellites are 
identified as all red-sequence galaxies at the cluster redshift 
brighter than M* -|-1. We model the signals simultaneously 
using a cluster halo model, in which we account for the 
miscentring distribution and the scatter between richness 
and mass. From these fits, we obtain the masses, the 
concentrations of the dark matter and of the satellite 
distribution, the bias, and the miscentring parameters. 

We parametrise the relation between mass 
and richness as M 200 = A x (77|qq''/ 20)“ and find 
A = (I5.0±0.8) X and a = 0.73 ± 0.07 

for our low-z clusters at 0.2<z<0.35. At intermediate 
redshift (0.35<2<0.55), we find a higher normalisation 
of A = (22.5 ±1.7) X lO^^h^Q Mq. Passive evolution and 
halo mass growth would lead to a higher normalisation at 
lower redshift, opposite to what we find. Hence we expect 
that this trend is driven from a fractional increase of N 200 
towards lower redshift, caused by the build-up of the red 
sequence. Similar trends were observed in the analyses of 
maxBCG clusters l|Becker et al.ll200^ iRvkoff et al.ll2008ll . 

Our measurements provide tight constraints on the 
cluster miscentring distribution. Only ^30% of our BGGs 
are located at the centre of the halo; the remaining 
BGGs follows a 2D-Gaussian, whose width is approxi¬ 
mately 0.2 hjTg^Mpc at 5 X and increases to 
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0.4 hyg^Mpc for our most massive systems. The ratio of 
the width and r20Q is flat with mass, with an average value 
of 0.44 ± 0.01. Our miscentring fraction is higher than 
what is typically reported In the literature, which might 
be caused by the commonly made assumption that the 
central galaxy always resides exactly at the centre of the 
dark matter distribution, and any miscentring is caused by 
not correctly identifying the central galaxy. In reality, even 
the central galaxy may be offset from the centre of the 
halo, as allowed for in our modelling, leading to smaller 
centring fractions. 

The mass-concentration relation of our low-z cluster 
sample agrees well with the predictions from numerical 
simulations, but for our intermediate- and high-redshift 
clusters, a higher normalisation is preferred. We hypothe¬ 
sise that this is the result of two effects: a selection effect, 
as our cluster finder preferably selects overdense systems; 
and a projection effect, where two clusters are located close 
to the line of sight and counted as one. We find further 
evidence for these effects from the constraints on the bias, 
which show a similar trend: fair agreement with numerical 
simulations at low redshift, but a preference for higher 
values at higher redshifts. 

The concentration of the satellite distribution decreases 
towards higher mass. It is correlated with the concentration 
of the dark matter. The corresponding Pearson coefficient 
has a value of 0.67lg;^4. 

We have tested the robustness of our results against 
various changes in our halo model, such as Including 
different implementations of the two-halo terms and the 
use of different priors in the fit. These tests show that 
all but one parameter are robustly extracted. Only Cgai 
changes considerably when the two-halo term is modelled 
differently, and hence their values need to be interpreted 
with care. Since a different implementation mainly leads 
to a rescaling, the decrease of Cgai with mass and the 
correlation between Cgai and C200 are a robust find of this 
work. 

The next step is to repeat our analysis on mock data. 
This is needed for a better characterisation of our cluster 
sample, in terms of cluster completeness, purity, and the 
assignment of the cluster’s centres. Without that, a robust 
cosmological exploitation of our measurements via the 
halo mass function is not possible. In addition, mocks will 
also be very useful to help interpret our current findings, 
for example whether selection and projection effects can 
explain the high concentration and bias values that we 
find, or whether there is another cause. Finally, we plan 
to use our cluster sample with its red-sequence members 
for a variety of follow-up projects, including a range of 
alignment studies. 
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Appendix A: BCG selection 

Both the stacked shear signal of all clusters and the total 
galaxy overdensity signal around the centre given by the 
cluster finder, reveal that the clusters have a broad miscen- 
tring distribution. In principle, if our cluster halo model is 
sufficiently flexible, our results should not critically depend 
on the choice of the cluster centre. However, given that the 
cluster miscentring distribution is somewhat uncertain, it 
is better to try to optimise the definition of the centre. 

To improve the centring of the clusters we attempt to 
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Fig. A.l. Shear signal around BCG candidates selected with the reference ‘BCG-500’ algorithm (top panels). The bottom 
panels show the difference in lensing signal between various BCG selection algorithms and the reference. The left-hand 
column shows the results for all clusters, the middle one for poor clusters and the right-hand panel for rich clusters. The 
error on the difference is simply approximated by the error on the reference shear measurement - the true error will only 
be slightly larger due to the large covariance of the results. The lensing signal around the ‘BCG-250’ is clearly smaller 
at small projected separations, indicating that a higher fraction of these BCGs are miscentred. 


identify the BGG. For this purpose, we use the catalogue of 
red-sequence galaxies that were used to identify the cluster. 
We try a number of different schemes for identifying the 
BGG and compare the stacked shear signal and galaxy 
overdensity signal. The most successful prescription will 
result in a maximal shear and galaxy overdensity signal at 
small scales; miscentring basically shifts power to larger 
projected separations. 

We start with identifying the brightest red-sequence 
candidate in the z-band at the cluster redshift within 
250 h^Q kpc, 500 kpc and 750 kpc from the 
original cluster centre and adopt it as the candidate BGG. 
The total lensing signal and galaxy overdensity around 
these BGG selections are shown in Fig. lA.ll and IA.21 
and are labelled ‘BCG-250’, ‘BGG-500’ and ‘BCG-750’, 
respectively. The shear signal around the ‘BCG-250’ BCGs 
is clearly lower at small scales than for the other options. 
Due to the large errors in the shear signal at small scales, 
there is no obvious difference in the shear signals between 
the ‘BGG-500’ and ‘BGG-750’ selections. However, the 
galaxy overdensity shows that the ‘BGG-500’ BCGs better 
coincides with the peak of the galaxy distribution, and 
hence are better centred. 

A visual inspection of a number of rich clusters reveals 
that in some cases, the brightest galaxy is quite far offset 
from the distribution of cluster members. Hence to improve 
the ‘BGG-500’ and ‘BCG-750’ selections, we compute the 
weighted z-band luminosity centre of the cluster using 
all red-sequence candidates that are located within 500 
hjQ kpc from the original cluster’s centre. If there exists 


a second-bright red-sequence candidate member at the 
cluster redshift that is at most 0.5 magnitude fainter than 
the hrst selected BGG, but that is located at least 100 
kpc closer to the centre of light, we adopt it as the BGG 
and consequently as the new centre of the cluster. This 
occurs in 30% and 34% of the ‘BCG-500’ and ‘BCG-750’ 
selections, respectively. The resulting BCG selections are 
called ‘BCG-500-I-’ and ‘BGG-750-1-’. From Fig. IA.2I we 
observe that the resulting BCG catalogues are indeed bet¬ 
ter centred. The one with the highest galaxy overdensity 
is ‘BCG-500-1-’. We therefore adopt the BGG candidates 
from this algorithm as the centres of the clusters. The 
richness estimates of the clusters are recomputed using the 
new centres. 


Appendix B: Impact of cluster redshift scatter 

The assigned ‘red-sequence’ redshifts of the clusters have a 
certain scatter with respect to their actual redshifts. This 
is caused by intrinsic scatter in the red sequence, uncer¬ 
tainties in the background subtraction, contamination of 
fore- and/or background galaxies, and noise. Redshift scat¬ 
ter affects the lensing analysis in three ways: it biases the 
adopted lensing efficiencies, it causes a radial smoothing of 
the lensing signal, and, if the signal is non-linearly redshift 
dependent, also an additional smoothing of the signal in 
that direction. 

The lensing efficiency does not linearly scale with lens 
redshift. Hence if the actual redshifts are scattered com- 
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Fig. A.2. Fractional galaxy overdensity around BCG candidates selected with the reference ‘BCG-500’ algorithm (top 
panels). The bottom panels show the difference in galaxy overdensity between various BCG selection algorithms and the 
reference. The left-hand column shows the results for all clusters, the middle one for poor clusters only and the right-hand 
panel for rich clusters only. The error on the difference is simply approximated by the error on the reference overdensity 
measurement - the true error will only be slightly larger due to the large covariance of the results. The ‘BCG-500-I-’ 
leads to the highest, most concentrated peak, indicating that these BCGs are closest to the centre of the cluster member 
distribution. 
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Fig. B.2. Impact of redshift scatter on the radial profile of the model signal. 


pared to the adopted ones, the average over these true lens- 
ing efficiencies does not equal the average over the leasing 
efficiencies of the adopted redshifts. We simulate the impact 
as follows. For a given cluster ‘red-sequence’ redshift, we as¬ 
sume that the true redshift distribution follows a Gaussian 
with a certain width (this is identical to assuming that the 
true distribution of redshifts is flat, and the ‘red-sequence’ 
redshifts follow a Gaussian distribution around the true 
value). We draw a large number of true cluster redshifts 
from this Gaussian, and for each we compute AS*™®. If 


the redshift that is drawn is lower than 0.05 or higher than 
1, we disregard it. We average these critical surface mass 
densities, and compare it to the AScrit of the input ‘red- 
sequence’ redshift. We plot the ratio in Fig. IB. II 

The average of the distribution of true AScrit is higher 
than the value of the single ‘red-sequence’ redshift (the one 
we would use). The size of the bias increases at lower red¬ 
shifts and for higher values of the scatter. For our cluster 
sample, the estimated size of redshift scatter is ~0.03. The 
difference between the mean of the true critical surface mass 
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Fig. B.l. Impact of redshift scatter on the lensing efficien¬ 
cies. 


densities and the one we used is smaller than one per cent 
over the range of cluster redshifts considered. This is con¬ 
siderably smaller than the statistical error, and we therefore 
ignore the effect. 

Next we investigate the impact of the radial smoothing 
due to redshift scatter. We only focus on the lensing sig¬ 
nal here; the cluster-satellite clustering should be affected 
in a similar way. We create a typical cluster lensing pro¬ 
file with our halo model for a certain ‘red-sequence’ red¬ 
shift at fixed physical separations. Next we assume that 
the true redshift distribution follows a Gaussian with a cer¬ 
tain width and draw true redshifts from that. For each red¬ 
shift, we compute the signal at the same angular separation 
(but corresponding to different physical separations), and 
do the radial binning assuming the redshift is the adopted 
‘red-sequence’ redshift. Hence we radially smooth the model 
profile. Then we compute the ratio of the average of this 
smoothed shear profile and the input profile, and show it 
in Fig. IB. 21 

In general, the smoothed profile is larger than the profile 
at a fixed ‘red-sequence’ redshift. The difference is largest 
at low redshifts, and increases when the redshift scatter in¬ 
creases. However, the actual ratio obviously depends on the 
model profile that we have assumed. Deriving a correction 
factor in a model independent way is therefore not obvious, 
and we refrain from doing so. It is also not necessary, given 
that the impact on the signal is of order a per cent, smaller 
than our statistical errors. 

Finally, there is the effect of additional smoothing in the 
redshift direction. Given that the scatter is much smaller 
than the redshift bin size used in this work, this effect is 
negligible. 
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